(ignore at your own risk)
.ipynb file.All the necessary data sets can be downloaded from the following Google Drive folder: https://drive.google.com/drive/folders/187ekqZ8FSeQF9dCKV66eMXEBnLIlaqqF?usp=drive_link
However, it is recommended that you use the Raw URL address below since one of the data sets is quite large.
mystery_data.csv:
dopamine_reaction_time.csv:
insta.csv
While navigating the shadowy lower levels of the CAB catacombs in
search of caffeine, an intrepid student archaeologist stumbled upon a
dusty, long-forgotten treasure chest. Inside: a tattered file of data
called mystery_data.csv, sealed in wax and covered in
cryptic statistical runes.
Your objectives:
Use ggplot2 to create a bar plot that clearly conveys differences between conditions in the data, if any.
Add error bars representing one standard error above and below the classic mean (i.e., not a trimmed or robust version).
Inside a data frame, display (at a minimum) the following statistics for each condition:
Repeat question 1, but use 10% trimming and have the error bars represent two-sided 99% confidence intervals. Inside your data frame, make sure to also include the confidence interval’s top and bottom boundaries for each condition.
Using the boxplot rule for outlier detection, list the amount of
outliers found for condition “I” in mystery_data.csv. You
are not permitted to use the functions quantile() or
IQR().
A psychologist studying expert performance under high-pressure conditions is examining how consistently elite athletes deliver when the stakes are highest. One of the most electrifying players in modern hockey, Connor McDavid 😍 of the Edmonton Oilers, recorded an astonishing 153 points over 82 games during the 2022–2023 NHL season, averaging approximately 1.87 points per game.
Suppose McDavid’s per-game scoring follows a normal distribution with a mean of 1.87 points and a standard deviation of 0.55 points.
Now imagine this psychologist is watching from the stands at Rogers Place, during a crucial playoff game, wondering if McDavid will rise to the occasion once again.
Using this model, what is the probability that McDavid scores more than 2.295 points in a randomly selected game?
From the previous question, what is the probability that McDavid scores between 1 and 2 points in a randomly selected game?
In an informal study of Oilers fans’ playoff rituals, a local sports psychologist surveyed fans outside Rogers Place to see how many lucky items they bring with them to each playoff game. These included things like lucky jerseys, special pucks, autographed McDavid photos, and even one person’s mysterious “Victory Pierogi.”
The distribution of “lucky item counts” brought to the rink is shown in the histogram below.
Calculate the mean number of lucky items brought by fans to the game based on the histogram.
A behavioral neuroscientist is interested in how a dopamine agonist affects motor response speed. Participants were randomly assigned to receive either a placebo or a low dose of a dopamine agonist before performing a computerized simple reaction time task. In each trial, a visual cue appeared at a random interval, and participants were instructed to press a key as quickly as possible in response.
The researcher computed each participant’s mean reaction time (in
milliseconds) across trials. They hypothesized that the dopamine
condition would result in faster overall reaction times. The
data can be found in dopamine_reaction_time.csv. Conduct an
appropriate statistical test that evaluates whether or not the
researcher’s hypothesis should be accepted.
Use of t.test() and yuen() is not
permitted.
In your output please report the following:
The null and alternative hypothesis you tested
The test statistic
Degrees of freedom
p-value
95% Confidence interval
Your conclusion
A recent exposé revealed that the Instagram app has been secretly logging private usage data from its users and transmitting it to its parent company, Meta. Following the data leak, major news outlets reported that Zoomers (members of Gen Z between the ages of 13 and 28) spend an average of 6 hours per day on their smartphones.
You manage to obtain a random sample of screen time data from
Zoomers. Using this sample, conduct an appropriate test that
evaluates whether the media’s claim of a 6-hour average should be
rejected. The data is in insta.csv. Use of
t.test() is not permitted.
Clearly state your null and alternative hypotheses.
Report the test statistic, p-value, 95% Confidence Interval.
Considering the values obtained in the previous question’s analysis, do you believe the media’s claim is unreasonable? Why? (No calculation is required for this question)
Suppose you record the number of times a neuron fires per second in response to a stimulus. What scale of measurement does this represent?
What scale of measurement would differences between rates of neuron firing be?
You measure the amount of dopamine (in nanograms per milliliter) present in a rat’s nucleus accumbens after exposure to a drug. What scale of measurement is this?
You group EEG data based on whether the participant was in a resting, task, or sleep condition. What scale of measurement is this?
A researcher counts how many milliseconds it takes participants to identify an image of a fearful face. What scale of measurement is used?
Subjects rate their pain during a mild electric shock using a 5-point scale from ‘no pain’ to ‘extreme pain’. What kind of measurement scale is this?
A neuroscientist records brain temperature (in Celsius) before and after a task. What scale of measurement does this involve?
Participants are asked to rank a series of images based on how emotionally disturbing they find them, from least to most disturbing. What scale of measurement is this?
In a memory experiment, participants are categorized based on the brain region that showed the most activation (e.g., hippocampus, amygdala, prefrontal cortex). What scale of measurement does this reflect?